Google DeepMind Launches Gemma Scope 2: A Full-Stack Explainability Tool for the Gemma 3 Model
Google DeepMind launches Gemma Scope 2, an open explainability toolkit designed to analyze information processing at all levels of the Gemma 3 language model, ranging from 270 million to 2.7 billion parameters. The tool helps AI safety and alignment teams track internal features of the model to address issues such as jailbreaking, hallucinations, or inappropriate behavior.